On the categorization of Cause and Effect in WordNet
نویسندگان
چکیده
The task of detecting causal connections in text would benefit greatly from a comprehensive representation of Cause and Effect in WordNet, since previous studies show that semantic abstractions play an important role in the linguistic detection of semantic relations, in particular the cause-effect relation. Based on these studies on causality, and on our own general intuitions about causality, we propose a cover-set of different WordNet categories to represent the ontological classes of Cause and Effect. We also propose a corpus-based approach to the population of these categories, whereby candidate words and senses are identified in a large corpus (such as the Google N-gram corpus) using specific syntagmatic patterns. We describe experiments using the CauseEffect dataset from the 2007 SemEval workshop to evaluate the most effective combinations of WordNet categories and corpus data. Ultimately, we propose extending the WordNet category of Causal-Agent with the word-senses identified by this experimental exploration.
منابع مشابه
Automatic Construction of Persian ICT WordNet using Princeton WordNet
WordNet is a large lexical database of English language, in which, nouns, verbs, adjectives, and adverbs are grouped into sets of cognitive synonyms (synsets). Each synset expresses a distinct concept. Synsets are interlinked by both semantic and lexical relations. WordNet is essentially used for word sense disambiguation, information retrieval, and text translation. In this paper, we propose s...
متن کاملThe Use of WordNets for Multilingual Text Categorization: A Comparative Study
The successful use of the Princeton WordNet for Text Categorization has prompted the creation of similar WordNets in other languages as well. This paper focuses on a comparative study between two WordNet based approaches for Multilingual Text Categorization. The first relates on using machine translation to access directly the princeton WordNet while the second avoids machine translation by usi...
متن کاملPunjabi WordNet Relations and Categorization of Synsets
This paper describes an attempt to develop Punjabi WordNet by using expansion approach from Hindi WordNet under Indradhanush WordNet Project. The origin, symbols, morphological and syntactic characteristics of Punjabi Language are presented in this paper. The lexical semantic relations used in Punjabi WordNet are elaborated. The need for synset categorization and the results of this categorizat...
متن کاملA New WordNet Enriched Content-Collaborative Recommender System
The recommender systems are models that are to predict the potential interests of users among a number of items. These systems are widespread and they have many applications in real-world. These systems are generally based on one of two structural types: collaborative filtering and content filtering. There are some systems which are based on both of them. These systems are named hybrid recommen...
متن کاملIntegrating a Lexical Database and a Training Collection for Text Categorization
Automatic text categorization is a complex and useful task for many natural language processing applications. Recent approaches to text categorization focus more on algorithms than on resources involved in this operation. In contrast to this trend, we present an approach based on the integration of widely available resources as lexical databases and training collections to overcome current limi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007